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^j. Abstract 

This paper studies price-based spectrum access control in cognitive radio networks, which character- 

izes network operators' service provisions to delay-sensitive secondary users (SUs) via pricing strategies. 

ry$ Based on the two paradigms of shared-use and exclusive-use dynamic spectrum access (DSA), we 

O 

i— i examine three network scenarios corresponding to three types of secondary markets. In the first monopoly 

i— I market with one operator using opportunistic shared-use DSA, we study the operator's pricing effect on 

> 

the equilibrium behaviors of self-optimizing SUs in a queueing system. We provide a queueing delay 

00 

analysis with the general distributions of the SU service time and PU traffic using the renewal theory. In 
terms of SUs, we show that there exists a unique Nash equilibrium in a non-cooperative game where SUs 
are players employing individual optimal strategies. We also provide a sufficient condition and iterative 
algorithms for equilibrium convergence. In terms of operators, two pricing mechanisms are proposed 
with different goals: revenue maximization and social welfare maximization. In the second monopoly 

• i-H 

market, an operator exploiting exclusive-use DSA has many channels that will be allocated separately to 
each entering SU. We also analyze the pricing effect on the equilibrium behaviors of the SUs and the 
revenue-optimal and socially-optimal pricing strategies of the operator in this market. In the third duopoly 
market, we study a price competition between two operators employing shared-use and exclusive-use 
DSA, respectively, as a two-stage Stackelberg game. Using a backward induction method, we show that 
there exists a unique equilibrium for this game and investigate the equilibrium convergence. 

N. H. Tran, C. S. Hong and S. Lee are with the Department of Computer Engineering, Kyung Hee University, Korea (email: 
{nguyenth, cshong, drsungwon} @khu.ac.kr). 

Z. Han is with the Electrical and Computer Engineering Department, University of Houston, Houston, USA (email: 
zhan2@uh.edu). 



December 17, 2012 



DRAFT 



2 



I. Introduction 

The concept of dynamic spectrum access (DSA) has been emerging as a new approach for efficient 
utilization of scarce wireless spectrum that is conventionally controlled via static licensing. DSA enables 
secondary (unlicensed) users (SUs) to flexibly access underutilized legacy spectrum bands that are used 
sporadically by primary (licensed) users (PUs) (TJ. This idea of reusing the legacy spectrum is receiving 
great supports thanks to the rapid development of software-defined radio and cognitive radio (CR) 
technologies [2], which can vary parameters such as frequency, power, and modulation schemes through 
software. Most DSA approaches (3J, Q are connected to a three-tier model of the dynamic spectrum 
market in Fig. [I] which includes three network entities: spectrum owners, secondary network operator^] 
and CR-enabled SUs. The spectrum owner can temporarily lease their spectrum to the operators through 
periodic auctions. In each period, the winning operators can provide secondary services for SUs via their 
leased spectrum by charging an admission price to SUs for an economic return. 

Among the various DSA approaches, opportunistic shared-use (i.e. spectrum overlay) and dynamic 
exclusive-use models have been widely considered |3j-||5j. Opportunistic shared-use allows spectrum 
owners employ "interruptible leasing" to operators, a model that forces operators to provide secondary 
services without harming the operations of PUs on the leased spectrum. Dynamic exclusive-use, on 
the other hand, allows spectrum owners to dynamically transfer spectrum-usage rights to operators, a 
model that enables operators to lease parts of temporarily unused spectrum (i.e. no PUs operations) from 
spectrum owners for secondary services provision. 

In this paper, we study pricing-based spectrum access control of SUs by operators, which is shown 
in the shaded region of Fig. [T] We consider two kinds of operators that employ the two different DSA 
approaches of opportunistic shared-use and dynamic exclusive-usd^ respectively. Their service provision 
to SUs are controlled through pricing-based methods and are considered in three secondary-market types: 
shared-use monopoly, exclusive-use monopoly, and duopoly pricing competition. 

In the first monopoly market, by considering delay-sensitive SUs that share a PU's single channel 
controlled by a shared-use operator, we examine the effect of the operator's pricing on the equilibrium 
behaviors of non-cooperative SUs. This behavior is represented by a SU's spectrum access decision upon 
arrival, which entails either joining the list of other SUs that also want to share the same channel or 
balking. In terms of SUs, we first introduce an individual optimal strategy employed by each SU as a 

'For brevity, henceforth we will use only "operators". 

2 For brevity, henceforth we will use only "shared-use" and "exclusive-use". 
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Fig. 1. Three-tier dynamic spectrum market. 



player in a non-cooperative game in order to make its spectrum access decision, based on a utility function 
that captures the delay-sensitivity heterogeneity of SUs. We next show that there exists a unique Nash 
equilibrium in this game. In order to employ the individual optimal strategy, each SU has to evaluate 
its mean queueing delay of a virtual queue that is used to model a congestion effect that occurs when 
many SUs intend to share the same PUs' channel. By using the renewal theory, we provide a queueing 
delay analysis based only on statistical information of the PUs' activities and SUs' service time. Finally, 
we examine equilibrium dynamics through iterative algorithms and provide a sufficient condition for 
equilibrium convergence. In terms of the operators, we devise two pricing mechanisms. For the case of 
the operator being a commercial planner, we propose a revenue-optimal pricing policy to maximize the 
operator's revenue by solving a convex optimization problem. For the case of the operator being a social 
planner, we propose a socially-optimal pricing policy to maximize network social welfare by solving an 
equivalent convex problem. 

In the second monopoly market, each entering delay-sensitive SU is allocated a dedicated channel from 
a list of channels managed by an exclusive-use operator. Similarly to the first monopoly market, we also 
study the effect of the operator's pricing on the equilibrium behaviors of the non-cooperative SUs. Since 
queueing delays of the SUs' jobs are equal to their service time due to these dedicated channels, the 
analytic results of the SU's equilibrium behaviors, revenue-optimal pricing and socially-optimal pricing 
can also be obtained in this market. 

In the third duopoly market, we treat the competition between shared-use and exclusive-use operators 
as a two-stage Stackelberg game, where two operators aim to maximize their revenues through pricing 
and each SU will make its decision of which operators to join based on the prices charged by the 
two operators. We first show that there exists a unique equilibrium for this game. We then explore the 
equilibrium dynamics via the iterative algorithms and provide a sufficient condition for the equilibrium 
convergence. 

The rest of this paper is organized as follows. Related work is reviewed in Section [TTJ and a system 



model is described in Section III We analyze the shared-use monopoly in Section IV the exclusive- 
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use monopoly in Section |Vj and the duopoly market in Section VI Numerical studies are provided in 



Section VII| Finally, we conclude our work in Section VIII 



II. Related Work 

There is an increasing interest in the three-tier model of the dynamic spectrum market, which consists 
of interactions among spectrum owners, operators, and SUs |6j-[10|. However, the model of price-based 



spectrum access control between operators and multiple SUs has attracted the interests. The studies in 
pT|-p3| examined cases in which multiple operators compete for SUs, whereas 1 14 1 considered multiple 



SUs competing to access operators' channels. The authors of fl3| considered short-term spectrum trading 
across multiple PUs and SUs. [16] considered interactions between an operator and multiple SUs. 

Nevertheless, most of these works focused on operators' pricing and the responses of SUs via their 
demand functions (e.g. the SUs' required bandwidth). In this paper, we focus on the pricing mechanisms 
and their impact on the equilibrium behaviors of SUs in a queueing system, which can be traced back 
from the original work of JT7), |18| and surveyed in the monograph p9| . To that end, there are many 



works which attempt to balance between observable and unobservable queueing models. The observable 



queue system [20]— 1 22 1 requires either a centralized control server or a feedback mechanism with time 
overhead. In contrast, we use the unobservable queue system, which appropriately models the non- 
cooperative and distributed nature of SUs where SUs have no information about each other. Recent work 
on this paradigm of CR applications used server vacations or breakdowns in their queueing systems to 



model the opportunistic shared-use approach [20 1, [23|. In |20|, a queue was centrally controlled so that 



the current queue length could be observed for the SU decision making process. This work also used a 
discrete-time model where all distributions of arrivals and services were simply limited to a Bernoulli 



distribution in order to facilitate analysis. The work in [23] used the unobservable and continuous-time 
models; however, the inter-arrival times and services were restricted to the exponential distributions for 
ease of analysis. Both papers also assumed homogeneous SUs and the Markovian channels to model the 
PU traffic. Our work not only considers the heterogeneous SUs in terms of delay sensitivity, but also 
provides a queueing model where the PUs' channels and the service distributions of the SUs can be 
general. 

III. System Model 

We assume that there are two wireless network operators providing different DSA models. The first 
operator, denoted by 0\, uses the shared-use model, whereas the second operator, denoted by O2, employs 
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the exclusive-use model. We consider a network that consists of either one of the operators (i.e. monopoly) 
or both of them (i.e. duopoly), which corresponds to three types of secondary markets (cf. Fig. [2]). A 
stream of SUs is assumed to arrive at the network and each of them will make a decision as to whether or 
not to join an operator (in the case of a monopoly) or which operator to join (in the case of a duopoly). 



A. SUs 

We proceed to describe important parameters of the SUs. 

1 ) Arrival Rates and Service Time: We assume that the SUs arrive at the network according to a Poisson 
process with rate A. Each SU is associated with a distinct job (e.g. a packet, session, or connection) that 
it carries upon arrival. The service time to complete a job is represented by a random variable X with a 
probability density function (pdf) fx{x). This service time is assumed to be independent of the arrival 
process. 

2) Delay-Sensitive User Types: Since the SUs are assumed to carry delay-sensitive traffic, each job 
is attached to a specific application type characterized by a parameter 9. This parameter represents an 
individual preference that reflects the delay sensitivity of the SU's application. The value of 9 varies across 
job types, capturing SUs' heterogeneity. Individual values of 9 are private, but their cumulative distribution 
function, denoted by Fq(.), is known. We also assume that this parameter follows a uniform distribution 



on [0, 6 up ], which is common in the literature [24|-|26|. The relationship between 9 and application types 



is presented through some examples: many multimedia applications with stringent delay requirements will 
have high values of 9; on the other hand, applications with 9 equal to zero are insensitive to delay. 

3) Individual Utilities: The value 9 of a SU is realized at the instant it arrives (not before). This so- 
called type-0 SU then must make a decision: either join the network or balk. The utility of any balking 
SU is set to zero. For a type-# SU that joins operator Oi, its utility function is given by 

Ui(9) = V - 9di - a, z = 1,2. (1) 



This utility function, which is widely used in the literature [25], (27J, captures the balance between a 
reward V and a total cost Odi + q that a SU undertakes once it decides to join the system. The reward V, 
which is assumed to be independent of SUs' application type, represents a benefit of a SU for accessing 



the service [27 1. The total cost consists of two elements: the admission price a charged by Oi (i.e. the 
SUs are price-takers), and the waiting cost 9d{ of a job that spends a delay di. In this waiting cost, 
the parameter 9 can be interpreted as a waiting cost per unit time, an interpretation that adheres to the 
delay- sensitivity mindset of 9: a higher waiting cost per unit time induces more negative effects of the 
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Fig. 2. Three secondary-market scenarios: (a) Shared-use Monopoly, (b) Exclusive-use Monopoly and (c) Duopoly. 



delay, which shows more sensitivity to delay. We also assume that the unit of is chosen such that 8di 
has the same unit of V — c%. 

B. Shared-Use Operator (0\) 

The operator 0\ is assumed to own a single channel. This channel is licensed to legacy PUs, and is 
shared opportunistically by multiple SUs based on an admission price charged by 0\. 

1) PUs: Traffic patterns of PUs on the licensed band can be modelled as an ON-OFF renewal process 
alternating between ON (busy) and OFF (idle) periods. We model the sojourn times of the ON and OFF 
periods as i.i.d. random variables Y and Z, with the pdf /y(y) and fz(z), respectively. We assume that 
the ON and OFF periods are independent with SUs' arrival process and service time. 

This ON-OFF process can be considered a channel model for the SU services. This model captures 
the idle time period in which the SUs can utilize the channel without causing harmful interference to the 
PUs. We note that this PU traffic model is more general than other Markov ON-OFF models in which 



both busy and idle periods are restricted to the exponential distributions [20|, |23|, [28|. 

2) Steady-State Virtual Queue: Since many SUs may attempt to share the same licensed channel, 
congestion can occur, which will clearly affect the delay of each SU job. Therefore, when a SU job 
arrives, it will evaluate its job's delay in a queue containing other SU jobs that also wish to use that 
licensed channel. This queue is only a virtual queue because each SU cannot observe how many other 
jobs are waiting before its job (since SUs cannot know how many other SUs are trying to share the 
licensed channel). Therefore, each SU forms a virtual queue based on the statistical information of A, 
fx(x), fy(y) and fz(z), which are assumed to be estimated by existing methods | |30[ , to assess the 
mean queueing delay that its newly arrived job incurs. There are also many proposals in the CR literature 



which use the concept of virtual queue to model the congestion effect [23 1, |29| in different contexts 
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Henceforth, we simply use "queue" to refer to this virtual queue. We can consider 0\ to be an M/G/l 



queueing system (cf. Fig. 2a I whose service time has a general distribution dictated by fx(x), fy(y) 
and fz{z) since a SU service occuring in OFF periods can be interrupted by the returning of PUs in 
ON periods. We denote E [T(A)] the mean steady-state queueing delay (i.e. waiting time + service time) 
induced by an arrival rate A. From ([1]), the utility of a type-# SU with 0\ is 

U 1 (9) = V -9E[T(\)}- Cl . (2) 

C. Exclusive-Use Operator (O2) 

The operator O2 is assumed to obtain (i.e. via leasing) the part of spectrum which is temporarily 
unused by the spectrum owner. This spectrum chunk is divided into multiple bands that have the same 
bandwidth as the single band of 0\. Since there is no PU traffic on these bands, SU services are not 
interrupted in this case. 

Whenever an arriving SU decides to join O2, the operator allocates a dedicated channel for the SU. 
We assume that O2 always has enough dedicated bands to serve the SUf] Therefore, we can consider 



02 to be an M/G/oo queueing system (cf . Fig. 2b 1 where queueing delays of all SUs are equal to E [X] . 
From {j}, the utility of a type-6> SU with 2 is 

U 2 {6) = V-6E[X] -c 2 . (3) 

IV. Type I: Shared-Use Monopoly Market 

In this section, we first investigate the SUs' strategies with the mean queueing delay analysis, the 
Nash equilibrium, and the equilibrium convergence. We then examine the 0\s optimal pricing policies 
in terms of the revenue and social maximization. 

A. SUs' Strategies 

1) Nash Equilibrium: We consider a stream of self-optimizing arriving SUs, which are concerned 
only with their own benefits. In the game theory context, the potential SUs behave like players in a 
non-cooperative game, and the decisions regarding joining or balking are their strategies. Specifically, 
upon arrival, each type-6> SU has to make a decision based on the joining probability p(8) € [0, 1]. Given 
the joining rule {p(9), 9 > 0}, the unconditional probability that a potential SU joins the monopoly Oi 

3 This assumption can be relaxed by "borrowing" more channels from other homogeneous operators when O2 lacks the 
dedicated channels (4). 
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is pi = J °° p(0)(1Fq(6). With this joining rule, the actual arrival rate to the system is Xpi. Because A 
is fixed, we denote queueing delay by E [T (pi)] rather than E [T(Api)] for ease of presentation. 

Since SUs are self-optimizing, each type-# SU will choose its joining probability p(6) to maximize its 
expected utility p{9)U\{9) + (1 —p(6))0, which corresponds to the following individual optimal strategy. 

Definition 1. An individually-optimizing type-6> SU that has U\{6) = V — #E [T(j?i)] — c\ will join 0\ 

• with probability p{9) = 1 if U\{9) > 0, which requires 

9 < 6 1 (p 1 ), where 9 1 ( Pl ) 4 ^'^y (4) 

• with probability p{9) = 0, otherwise. 

Condition ([4]) states that when all SUs employ the individual optimal strategy, only a fraction of SUs 
that have 9 values less than 9\{pi) will join the 0\. Since the unconditional joining probability p\ can 

be considered to be the fraction of SUs that join 0\, we have 

roo rdi(pi) 
pi=/ P (9)dF e (9)= dF @ (9) = F e (9 l (p 1 )). (5) 

J o Jo 

Therefore, the equilibrium of the SUs' joining probability to 0\ is defined as follows. 
Definition 2. p\ is a Nash equilibrium of SUs' joining probability in a shared-use monopoly if it satisfies 

p* 1 = F {9 1 (pl)). (6) 

This definition shows that once reaching an equilibrium, the fraction of joining SUs remains the same 
hereafter. The equilibrium p\ is called a Nash equilibrium if at this point, no SU has any incentive to 
deviate from its strategy assuming that all other SUs continue to follow their strategies. The following 
theorem establishes the existence and uniqueness of the Nash equilibrium, of which the proof is provided 
in Appendix [A] 

Theorem 1. For a given admission price c±, there exists a unique Nash equilibrium of the SUs' joining 
probability p\ in a shared-use monopoly market. 

2) Queueing Delay Analysis: In order to perform its individual optimal strategy, each SU must estimate 
the mean queueing delay, which will be analyzed in the sequel. 

We assume that a SU can use its spectrum sensing and handoff capabilities to detect and protect 
the PUs, respectively. Spectrum sensing is used to inform the SU whether the channel is busy or idle. 
Moreover, sensing errors are assumed to be negligible. When the channel is sensed to be idle, the SU job 
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Fig. 3. A sample ON-OFF process with a realization of an effective service time X e where its original SU service time is 
increased because of three interruptions from ON periods Yi, Y% and Y3. 



can be in service. When the channel is sensed to be busy, the spectrum handoff procedure is performed 
to return the channel to the PUs. This kind of the listen-before-talk channel access scheme has been 



adopted in the quiet period technique of the IEEE 802.22 standard [31 1. During the service time of a SU 
job, it is likely that the SU must perform multiple spectrum handoffs due to multiple interruptions from 
the returns of PUs, represented by ON periods. Spectrum handoffs, which are employed to protect PU 
traffic and to provide reliable SU services, help SUs vacate the channel during ON periods and resume 
their unfinished services after ON periods end. Clearly, in the case of multiple spectrum handoffs, the 
original service time X of the SU job is increased, as illustrated in Fig. |3j and this increased service 
time is called the effective service time and is denoted by a random variable X e . 

We begin the analysis by denoting E [W(pi)] the mean waiting time in the M/G/l queueing system 
whose mean service time and arrival rate are E [X e ] and Api, respectively. According to the Pollaczek- 
Khinchin formula [32], the mean waiting time is 



Using the mean value analysis in |32|, we have the extended- value mean queueing delay as follows 



E[W( Pl )] + E[X e ], if X Pl <l/E[X e ]; 
V[T( P1 )} = { (8) 

00, otherwise. 

This extended- value queueing delay can eliminate the explicit condition Xp\ < 1/ E [X e ] in our arguments 
hereafter. The problem boils down to how to derive E [X e ] and E [Xf\ , the first and second moments 
of the effective service time, respectively, in order to estimate the mean queueing delay. We proceed to 
use the renewal theory to derive these moments based on the statistical information of the SU service 
time and the ON-OFF process. 

The First Moment of Effective Service Time. Defining a random variable, N(X), as the number of 
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Fig. 4. Mean queueing delay performance comparison: (a) heavy PU traffic model, (b) light PU traffic model. 



renewals (i.e. ON periods) occurring in the interval (0, X), we have 



E [X e ] = E 



* — 'i=l 



E [X] + E 



E [X] + E [E] E [iV(X)] , (9) 



where the final equality occurs because E is independent of X. From (33] pp. 45], we have 



E[Z]- 



E [N(X) \ X = x] 

As a consequence, E [N(X)] = which is then substituted into ([9]) so as to obtain 

E[y]' 



E[X e ]=E[X]^ + E[z]; . 
The Second Moment of Effective Service Time. We continue with 



E[* e 2 



E 



(10) 



(11) 



(12) 



Using the law of iterated expectations, the second term of the right side of ( 12 1 can be shown to be 



E 



N(X) 



E I X = x 



x E [N(x)Y] = x E [TV (a;)] E [Y] = x : 



; E[E] 

B[zy 



(13) 



where we have the second equality because of the independence between Y and N(x). Hence, we obtain 



E 



N(X) 



^-—'i=i 



e 



E [X 2 



EJE] 
E[Z] 



(14) 



Next, we derive the third term on the right side of ( [12] ) as follows 

2" 



E 



N(X) 



1=1 



E 



E 



i=l 



+ E 



E [N(X)} E [E 2 ] + E [E] 2 E [(N(X) - l)N(X)] 



(15) 
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We define g(X \ X = x) = E [(N(X) - l)N(X) \ X = x] and denote the Laplace transform of an 
arbitrary function f(x) by f*(s). Using the similar technique of deriving the variance of the number of 
renewals in (33} pp. 55], we can easily obtain the following result 

9{s > = s ^ lz]1 -r z (sy <l6) 

An inverse Laplace transform can then be applied to g*(s) so as to obtain g{x). Therefore, g(X) can be 
found correspondingly. From (12 1, ( 14i, and ( p"5) ), we can see that E [Xl] is completely derived. 



Examples with Analysis and Simulation Comparisons. We supplement the queueing delay analysis 
through a performance comparison of the analysis and simulation by the following five examples. 

(a) X, Y and Z all have the exponential distributions with fx{x) = nxe~ flxX , /y(y) = /ione _MonS/ and 
fz{z) = /i ffe _Moff2 , respectively. This combination is termed Exp, and we obtain 

1 ( - [loft \ 



E[X e ] = — 1 + (17) 
EM^ + ^ + ^ + l. (18) 

^on^X f^on^X Hon^x f^X 

(b) X, Y and Z all have the Erlang distributions with fx(x) = ^ 2 x xe~^ xX , fy(y) = [^ nV e ~^ mV an d 
fz{z) = ^ ff ze~ MoffZ , respectively. This combination is termed Erl, and we obtain 

E[X e ] = ^(l + t * off ), (19) 
^[^21 6^ ofF 12/Xpff 6 8/zg ff (3/j fF + 2/i X ) ,_ m 

11/ A. = —5 1 7T H 5- H S 7T-7- 777 ■ (zU; 

/^on/** fijf /i^ n /i^(2/i ofF + ^X) 2 

(c) X is uniformly distributed on [xi Q ,a; U p], whereas Y and Z have the exponential distributions with 
fyiu) = Hoi\e.~^°" v and /z(-z) = ii ^e~^ MZ , respectively. This example is termed UniExp, and we 
obtain 

E[Xe] = ^lo+^up f 1 + b*\ ( 2i) 

E [Xl] = V ( 22 ) 

48/u of f^ n (a; U p - x\ ) 

where iV = (a^ p _ xf G )(4^ ff + 16/^ ff/ion + 16^ ffMon) + ^lff{x 2 up ~ xf Q ) + 6fi off {x up - x {o ) + 

(d) X has the Erlang distribution with fx(x) = fi x xe~ flxX , whereas Y and Z have the exponential 
distributions with /y(y) = ^one~ Mon2/ and fz{z) = /ioffe~ MoffZ , respectively. This combination is 
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termed ErIExp, and we obtain 

E [X e ] = — (l + H*L\ (23) 

fJ>X V ^on/ 

Era=^% + ^° fL + 12 ^ L + ^- (24) 

KnVx Von^X Von^x H X 

(e) X has the exponential distribution with f x (x) = fj,xe~^ xX , whereas Y and Z have the Erlang 
distributions with fy(y) = \^ nV e ~ ~^°" y anc ^ fz{z) = n^f f ze' fl °" z , respectively. This combination is 
termed ExpErl, and we obtain 

1 A MofA 



E[X e ] = — 1 + 1 (25) 

V /"on / 

E[l e 2 ] =T ^ + ^«* +J**_ + (2 6 ) 

In order to validate our queueing analysis, we simulate a single-server queue with service interruptions 
for the performance comparison. In all five examples, we fix A = 1 and vary p\ to adjust the traffic load 
into the queue. We set \i x = 1 f° r both Exp and Erl, fj,x = 1-5 for ErIExp, jj, x = 2/3 for ExpErl and 
[^lo^up] = [0.1, 1.9] for UniExp. The comparison between the analysis and the simulation is presented 
in Fig. [4] in two scenarios: the left figure shows the results of the setting /i on = 0.5 and /i Q ff = 1.5, 
which represents a heavy PU traffic model in urban areas, whereas the right figure shows the results of 
the setting fi on = 1.5 and fi ff = 0.5, which represents a light PU traffic model in rural areas. Despite the 
variation in numerical settings, Fig. [4] shows that the analysis results are very similar to the simulation 
results. 

3) Equilibrium Convergence: We focus on the algorithm and condition for the convergence of the 
equilibrium joining probability p\. We assume that the system operates over successive time periods 

labeled t = 0, 1, 2 The arrival rate is Xp\ during a period t, which is assumed to last long enough for 

the system to attain the steady state. Since the mean queueing delay varies with the joining probability 
of the SUs, each type-# SU will make a joining decision in the next time instant t + 1 by forming a 
prediction of the queueing delay denoted by T t+1 . Hence, this SU will join the network at period t + 1 
if and only if V > 9T t+l + c\. One of possible prediction techniques is every SU expects that the mean 
queueing delay in the next period is equal to that in the current period: T t+1 = E [T(p*)l. Defining 
liPl) — ^©(0l(pi))> we describe the dynamics of SUs' joining probability via two iterative algorithms, 
namely static expectations and adaptive expectations J34J, of which the SUs' joining probability evolves 
as follows, respectively 

p\ +l = q(p\) (27) 
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and 

p\ +1 = {l-a)p\+aq(p\), (28) 

where a G (0, 1]. We can see that the adaptive method is reduced to the static method when a = 1. The 
static method is also called the naive expectations method p4| because it assumes that each SU ignores 
similar actions of the others. In order to alleviate shortcomings of the static expectations, the adaptive 
expectations method - with the intuition that only a fraction of SUs make decisions to change at a given 
time - allows SUs to learn from and correct for past errors. We obtain the following result which is 
proved in Appendix [B] 

Theorem 2. With any starting point p\ 6 [0, 1] and an a a (0, 1], the sufficient condition for the 
equilibrium convergence of the SUs' joining probability dynamics in \2%\ is 



ffli < I (29) 

If A < 1/ E [X e ] , we can always find a sufficiently small a such that the convergence of the equilibrium 
is guaranteed globally. If A > 1/ E [X e ] , this condition is always violated because the left side of p9] l 
goes to oo. However, this is a sufficient condition, which does not imply that the equilibrium diverges 
when it is violated. It may still converge locally when the starting point is in a neighborhood of the 
equilibrium. We will illustrate this effect in the numerical section. 

B. Optimal Pricing Mechanisms of 0\ 

The main focus of this subsection is the operator's use of pricing as a way to maximize its revenue 
as well as the social welfare of the CR system. 

1 ) Revenue-Optimal Pricing: When charging a price c\, 0\ can attain an equilibrium revenue R\{c\) = 
c\p\{c\), where p\(c\) is the equilibrium at price c\ defined in Q. The problem of finding the revenue- 
optimal price that maximizes d's equilibrium revenue can then be expressed as 

max. Ri(ci). (30) 

Cie[ci„,e up ] 

Based on ([6]), (|7]>, and ([8), we obtain 

0, if ci > c up , 



if ci < c lo , (31) 



v / n+E[x e ](A(y-c 1 )+e up ) otherwise 

A0 up (2E[X e ] 2 -E[JQ]) ' ° tnerW1Se ' 
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where 



c up = V, 

C| = max{0,T/-# up E[T(l)]}, 

n = 2A^ up (y - ci) e [x 2 ] + e [x e ] 2 (a(ci - f) + <? up ) 2 . 



(32) 
(33) 
(34) 



From ( |3T| ), we can see that R\(c{) is a concave function; hence, the solution of the problem (30 1 
can be solved efficiently. When 0\ uses this c\ for admission pricing and all SUs employ the individual 
optimal strategy, the corresponding equilibrium joining probability will be p*(c^). 

2) Socially-Optimal Pricing: Network social welfare is considered to be the aggregate utility obtained 
by all SUs. When 0\ charges a price c\, at the equilibrium, only the SUs with 6 < 9\(jp\{c\)) that join 
the CR network have positive utilities according to ([4]). Therefore, the network social welfare at price c\ 
is expressed as follows 

SM)= {V-9E[T(p$( Cl ))})dF e (9), (35) 

Jo 

where 9\(p\{ci)) is the cut-off SU at price c\. The socially-optimal pricing problem can then be cast as 

max. Si(ci). (36) 

cie[cfa,Cup] 

However, solving this problem is difficult due to the complex functions 9\(p\{c\)) and p\{c\). Observing 
that 

Wi(ci)) 



Pt( ci ) = F e {9 1 (pt(c 1 ))) 



(37) 



'up 



we instead change the choice of variable from c\ to a cut-off SU variable denoted by 9\. Then the new 
objective function is 



S x {h) = J o (V-9E[t (9x/9 up j\j dF e {9) = V ( 
Hence, an equivalent maximization problem of ( [36] ) is as follows 

max. 5i(^i). 

ei>o 



up 



20 U p 



E 



T[0 x /0 up 



(38) 



(39) 



We can observe that Si{9\) is concave in its domain; hence, the solution of ( |39| ), denoted by 9\, can be 
solved efficiently. Then, from Q, the socially-optimal price cf can be calculated as 



4 



V 



■E 



T(9f/9 up ) 



(40) 



When 0\ uses this c\ for admission pricing and all SUs employ the individual optimal strategy, the 
corresponding equilibrium joining probability will be pKcf). 
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V. Type II: Exclusive-Use Monopoly Market 

In this section, we first investigate the SUs' strategies and the Nash equilibrium. We then examine the 
2 s optimal pricing policies in terms of the revenue and social maximization. 

A. SUs' Strategies 



The self-optimizing behaviors of SUs are similar to those in Section IV where the strategy of each 



type-6> SU is characterized by its joining probability p(9). Given the joining rule {p(0), 9 > 0}, the 
unconditional probability that a potential arriving SU joins the monopoly O2 is P2 = / °° p(6)cIFq(9). 
Since a self-optimizing type-6> SU will choose p(9) to maximize its expected utility p(9)U2{9) + (1 — 
p(9))0, we have the individual optimal strategy of a SU in an exclusive-use monopoly as follows. 

Definition 3. An individually-optimizing type-0 SU that has U~2{9) = V — 9 E [X] — C2 will join O2 

• with probability p{9) = 1 if U2{9) > 0, which requires 

9<9* 2 , where 9* 2 4 ^ZJ* (41) 

• with probability p{9) = 0, otherwise. 

Therefore, the equilibrium of the SUs' joining probability to O2 is defined as follows. 

Definition 4. p 2 is a Nash equilibrium of SUs' joining probability in an exclusive-use monopoly if it 
satisfies 

POO P62 

p* 2 = p(9)dF e (9)= dF e (9) = F e (9* 2 ). (42) 
J Jo 

It is clear that, for a given admission price c 2 , there exists a unique Nash equilibrium of the SUs' joining 
probability p 2 . 



B. Optimal Pricing Mechanisms of 2 

1) Revenue-Optimal Pricing: When charging a price 02, O2 can attain an equilibrium revenue i?2(c2) — 



C2p\{c2), where p\{c2) is the equilibrium at price C2 found in (42 1. It is clear that the revenue-optimal 

price of O2 is c? = ¥, which is the solution of the problem max. R 2 (c 2 ). 

2 c 2 e[o,v) 
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2) Socially-Optimal Pricing: The network social welfare is expressed as follows 

*2 



S,(c 2 )= r 2iC2 \v-eE[X]) d F e (e) = v e -^ -%^E[X], (43) 

JO "up ^"up 



where Q\ (02) is a cut-off SU at the price 02 according to (41 1. It is clear that the socially-optimal price 

of 02 is Co = 0, which is the solution of the problem max. 82(02). 

c 2 e[o,v] 

VI. Type III: Duopoly Market 

In this section, we consider a duopoly market in a CR network where both 0\ and O2 compete with 
each other in terms of pricing in order to maximize their revenues. Based on the prices set by the two 
operators, a #-type SU will decide either to join one or to balk to maximize its utility. This duopoly model 



is illustrated in Fig. 2c The relationship between operators and SUs can be seen as a leader-follower 
game that can be studied using the two-stage Stackelberg game. Specifically, the operators are the leaders 
that simultaneously set the prices in Stage I, then SUs will make the joining decisions in Stage II. 

A. Backward Induction for the Two-Stage Game 

We examine the subgame perfect equilibrium of this Stackelberg game by using a common approach: 



the backward induction method (33J, [36]. The equilibrium behaviors of the SUs in Stage II will be 
analyzed first. Then, we investigate how operators determine their prices in Stage I based on the SUs' 
equilibrium behaviors. 

1) SUs' Strategies in Stage II: In this stage, when two operators are present in the network and set 
the prices (01,02), each type-# SU upon arrival will have to choose one of three possible options: join 

01, join 02, or join neither. We denote px(c\, C2) and £2(01,02) as the fraction of SUs that join 0i and 

02, respectively. Henceforth, we simply use the notation p\ and p 2 . We also define an indifference SU 
as follows 

»W * EFr£)i-EW <44 > 

Each SU is assumed to be a rational decision maker in that it only chooses one operator to join if its 
utility with this operator is both positive and higher than that with the other operator, which corresponds 
to the following individual optimal strategy. 

Definition 5. An individually-optimizing type-0 SU that has U\{6) = V — 9E[T (pi)] - c\ with 0\ and 

U 2 {6) = V - 6E [X] - c 2 with 2 , where E7i(0(pi)) = U 2 (0{ Pl )), will join 
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O x with probability p{9) = 1 if Ui{6) > U 2 {9) and Ui{6) > 0, which requires 

9 < 9{ Pl ) and 9 <9 1 (p 1 ), (45) 
02 with probability p(0) = 1 if U 2 {9) > Ui{6) and U 2 (0) > 0, which requires 

S(pi) <e<ei, (46) 

neither with probability p{9) = 1 if f7i(0) < and C/2 < 0, which requires 

9 > 0i (pi) and 6> > 0J. (47) 



Recall that ^i(pi) and 9\ are given in Q and ( |4T| ), respectively. Based on ([45]), (j46|> and ([47]», the 
unconditional joining probabilities (pi,P2) are as follows 

/ /.minfOxCpO.flCpi)} /-e; \ 

(pi,p 2 ) = / iFe(e), /_ dF e (0) . (48) 



Je( Pl ) 



With the time slot model as in Subsection IV-A3 and according to ( [48] ), the SUs' joining probability 



dynamics in the duopoly can be described as follows 

(a) If 9{p\) < 0, which leads to c 2 < c\ since E [T (p\)] > E [X] , \/t, then 0\ is eliminated from the 
competition, leaving O2 as a monopoly. We have 



{p\ +1 ,P2 +1 ) = (0, F e (9* 2 ))yt. (49) 
(b) If < 9{p\) < 9* 2 , which leads to 9% < 9 1 (p\), then we have 

{p\ + \ P2 +1 ) = (M S (ti))i F ® ( *2) ~ MHPl)))- (5°) 



(c) If 9(p\) > 9 2, which leads to 9\ > 9i(p\), then O2 is eliminated from the competition, leaving (D\ 
as a monopoly. We have 

(p t 1 +1 ,P t 2 +1 ) = (Fe(9 1 (p{)),0). (51) 
Since there exists a sufficiently small e and the corresponding p e = min{l, AE | X 1 — e} such that 

p\<p e yt, (52) 
we define an equilibrium (p*,p 2 ) in a duopoly market with the given prices (c\,c 2 ) as follows. 
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Definition 6. Given a sufficient small e, (Pi,p|) * s a Nash equilibrium of SUs' joining probability in a 
duopoly market if it satisfies 



Q,F e (9* 2 )), if0(p e )<O, 

li>lfi) = \ Oe F e (0 2 *)-F e (%!))), if < 0(p<) < 0|, (53) 

Fe(^i(pl)), o), if0(p e )>0|. 
We have the following result which is proved in Appendix |Cj 

Theorem 3. For a given admission price pair (ci,C2), there exists a unique Nash equilibrium of the 
SUs' joining probability (p*,^) satisfying ( |53| ) in a duopoly market. 



We can see that both #(p e ) < and 9(p e ) > Q\ correspond to the equilibrium behaviors of the shared- 
use and exclusive-use monopolies in Section [IV] and Section [Vj respectively. Therefore, we will focus 
only in the case < 6(p e ) < 9Z, in what follows. 

Regarding the convergence of the equilibrium {p\,p2), the static expectations method is presented in 



( |50| ). The adaptive expectations method can also be presented as follows 

p*+i = (l-a)p* + aqd (p*), (54) 

where a G (0, 1], p* = (p\, p^) an d 

Qd(p') = (Fe{e( P \)), F e (9* 2 ) - F e (S{p\))). (55) 

We obtain the following result which is proved in Appendix |D| 

Theorem 4. With any starting point (p?,^) ^ [^) I] 2 an d an a (0, 1], f/ie sufficient condition for the 
equilibrium convergence of the SUs' joining probability dynamics ( |54[ ) is 

E' [T(l)l 1 

' J < -. (56) 



E[T(1)]-E[X] a' 

Since the left side of ( [56] ) is strictly greater than that of ( [29] ), we see that the equilibrium convergence 
condition of a duopoly in ([56]) is more stringent than that of the shared-use monopoly in ( |29] >. 

2) Price Competition in Stage I: In this stage, the operators determine their pricing strategies based 
on {p\,p%) in Stage II. Given a pair of prices (ci, 02), the equilibrium revenue of the operator i is 

Ri{ci,c 2 ) = Cip*, i = 1,2. (57) 
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Here, (p*,^) * s gi ven i n (53 1 in the case of the duopoly coexistence with < 9(p e ) < 9%, which 
corresponds to the condition 

ci < c 2 < n(ci), (58) 

where 

( , a F(E[r(p*)]-E[jr])+E[jr] Cl 

U(ci) = — =; T=r, — rr • (59) 

The competition between two operators in Stage I can then be modelled as the following game 
. Players: Ox and 2 , 

• Strategy: 0\ chooses price c\ G (0, C2); O2 chooses price C2 G (0, u(ci)), 

• Payoff function: Ri(c\,C2), i = 1,2. 

We denote the Stage I game equilibrium by (c*,c|), and define pf = p*(cf,c|), i = 1,2. 

Theorem 5. There exists a unique Nash equilibrium of a Stage I game such that 

( V(E [T(pf)]-E[X]) 2V(E [T(pf)]-E[X}) \ 
ll52j ^ 4E[T(pf)] -E[X] ' 4E[T(pf)] - E [X] J ^ J 

OT/ 

= —= with A = E [X e ] 2 (XV - 36U) 2 + 8^ P AF E [X 2 1 . (61) 

The proof of Theorem [5] is given in Appendix [e] We then examine whether (c^,c^) satisfies the 
condition ( |58] l or not. Since the lower bound of ( [58] ) is clearly satisfied, we check the upper bound 
condition c* 2 < u(c\), which is equivalent to the following inequality after some algebra manipulations 

2E [T(pf)] E pV)] > E [X] (3E [T(pf )] - E [T(p%. (62) 

It turns out that ([62} is true since E [T(p e )} > E [X] and E [T(p e )} > E [T(pf )] . 



B. Equilibrium Summary 

We summarize all equilibrium cases in Table [I] Since O2 has dedicated channels for SUs and provides 
less delay than that of 0\, it is intuitive that O2 becomes a monopolist when c\ > c%. However, if C2's 
price is much higher than Oi's in case of C2 > u(c\), then 0\ becomes a monopolist. Only in the case 
of ci < C2 < u(c\), both operators can share the market and the unique subgame perfect equilibrium 
of the Stackelberg game is ((c^c^), (pf,pf ))- To attain this equilibrium, the operators first update the 
statistical information of SUs and PUs to calculate {c\,d^} according to ([60j> and broadcast. Based on 
these prices, the SUs then employ the individual optimal strategy, inducing the result (pf,pf). 
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TABLE I 

Equilibrium Summary 



Pricing space 


Cl > C 2 


Cl < C2 < m(ci) 


C2 > u(ci) 


Equilibrium 


O2 monopoly Nash equilibrium 

(Pi.pS) = (0, F e (05)) 


Duopoly subgame perfect 
equilibrium ((<:*, Ca), (pf,p®)) 


d monopoly Nash equilibrium 

(pi,pS) = (fe(0i(Pi)), 0) 




Cl 



Fig. 5. d's revenues (top plot) and SUs' equilibrium joining probabilities (bottom plot) as functions of the admission price. 



VII. Numerical Results 

In this section, we apply the analysis results to numerically illustrate the SUs' equilibrium behaviors 
and the optimal pricing strategies in the 0\ monopoly market first, and then the interaction between 0\ 
and 02 in the duopoly market. To facilitate the illustration, the parameter settings adhere to the following 
order of Exp, Erl, UniExp, ErIExp and ExpErl examples with light PU traffic model (i.e. [i on = 1.5 and 



A*off = 0.5) in Section IV-A2 Furthermore, we set V = 1, 6 up = 1, a = 0.3, and e = 0.01. 



A. Shared-Use Monopoly 

1 ) Revenue Optimization: The top part of Fig. [5] shows graphs of 0\ 's revenue with respect to price c± . 
We can see that the revenue functions have convex forms and their maximum values are achieved nearly 
at the same price (0.58) with the corresponding revenues 0.13,0.07,0.14,0.1 and 0.08 with respect to 
the order of the example settings. At c\ = and c% = V = 1, all revenues are zero, which is clear due to 
the revenue function and the individual optimal strategy definitions. The equilibrium joining probability 
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Fig. 6. The network social welfare as functions of the cut-off SUs. 

0.5 
0.4 
^0.3 
0.2 
0.1 

2 4 6 8 10 12 14 16 18 5 10 15 20 25 30 
iterations iterations 

Fig. 7. The convergence of equilibrium joining probabilities p*(ci) and p*(cf) with static expectations (left plot) and adaptive 
expectations (right plot). 

p\{ci) is plotted in the bottom part of Fig. [5] At the price cf = 0.58, we can see that the corresponding 
Pi(cf) are 0.21,0.11,0.23,0.17 and 0.14 with respect to the order of example settings. This plot also 
demonstrates that when c% is increased, p\(c\) is decreased. 

2) Social Optimization: The network social welfare as a function of cut-off user 9\ is shown in 
Fig. [H] It can be seen that the socially-optimal cut-off SUs 9f are 0.3,0.16,0.33,0.25 and 0.21, and the 
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Fig. 8. Pi( c i) an d Pi( c i) converge locally (left plot) and globally (right plot) when the condition in (29) is violated and 
satisfied, respectively. 



corresponding socially-optimal values Si(6f) are 0.19,0.1,0.21,0.16 and 0.13 with respect to the order 
of the example settings. The respective socially-optimal prices cf can be calculated according to (|40i. 



Compared with the bottom plot of Fig. [5j these prices map correctly with the corresponding values of 
Pi(cf), which is equal to Of since 6 up = 1. 

3 ) Equilibrium Convergence Dynamics: With the starting point p\ set to zero, the equilibrium conver- 
gence of all settings using static and adaptive expectations are illustrated in Fig. [7] Although the condition 



in (29 1 is violated in all five settings (i.e. A > 1/ E [X e ]), it can still be seen that all joining probabilities 
converge to the expected equilibrium points presented previously as Theorem [2] gives a sufficient but not 
necessary condition. 



In Fig. [8j we examine the local and global convergence when the condition in ( |29| ) is violated and 
satisfied, respectively. With the Exp setting, the left plot of Fig. [8] shows that the equilibria p*(c^) and 
Pi(cf ) can converge if we choose the starting points in the range of [0, 0.75]. We observe that if the starting 
point is larger than 0.76, the divergence occurs. The right plot of Fig. [8] shows the global convergence 
of the equilibria p*(c±) and p\(cf) in a new setting that is the same as the Exp setting except that the 



parameter fix is changed to 2, inducing the condition in ( |29| ) is satisfied with a = 0.3 
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B. Duopoly 

We continue to illustrate the price competition between 0\ and 02. We first consider the effect of the 
Stage Fs equilibrium (c*, c|) on the Stage IPs equilibrium ip{,p%)- From ( [60] ), the equilibrium values are 
( c *,c*) = (0.13, 0.26), (0.19, 0.38), (0.11, 0.22), (0.16, 0.32) and (0.18,0.36) with respect to the order of 
example settings. With these equilibrium prices, the corresponding (p^prj) convergence of all settings is 



shown in Fig. 9a 



We next illustrate the tendency of a duopoly to form a monopoly if the condition in (58 1 is violated. 



We first choose a price pair that is close to the upper bound of this condition, where c\ = = 0.58 



and C2 = 0.99 u(c\). Fig. 9b shows that with this price pair, the p\ of all settings converges closely to 0, 
whereas p\ converges closely to the equilibrium in the 0\ monopoly shown in Fig. [5] We then choose 
a price pair that is close to the lower bound of the condition in ([58]), where C2 is set arbitrarily to 0.5 



and ci = 0.99 C2- With this price pair, as shown in Fig. 9c the p\ of all settings converges closely to 
0, whereas p\ of all settings converges closely to Fq> (#2) = gpjp the equilibrium joining probability of 
the 02 monopoly 

The data shown in Fig. [9] not only validate our analysis of the price competition of a duopoly, but also 
provide the convergence behaviors of different methods. In Fig. [9] all the graphs in the left column show 
the convergence with the adaptive expectations method, whereas the graphs in the right column show the 
convergence with the static expectations method. 

VIII. Conclusion 

This paper describes the price-based spectrum access control between the operators and SUs in three 
market scenarios. The interactions between the first monopolist operator with shared-use DSA and delay- 
sensitive SUs is examined through a queueing analysis representing the SU's congestion due to the shared 
single channel. We show that there exists a unique Nash equilibrium in a non-cooperative game where 
the SUs are the players employing individual optimal strategies for spectrum access. We also provide a 
sufficient condition and the iterative algorithms for the equilibrium convergence. The pricing mechanisms 
of the operators are also considered for two problems of revenue and social welfare maximization. The 
second monopolist operator using exclusive-use DSA has many channels to dedicate to SUs. Owing to the 
separate channels, the analysis of the interactions between the operator and the SUs is straightforward, 
yet provides useful insights in the third market analysis. In the third duopoly market, we study the 
price competition between two operators using shared-use and exclusive use DSA. We formulate this 
competition as a two-stage Stackelberg game. The equilibrium behaviors of the SUs in Stage II are 
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analyzed first, and we then examine how operators determine their prices in Stage I based on the SUs' 
behaviors. Using the backward induction method, we show that there exists a unique equilibrium in this 
game and investigate the equilibrium convergence. 

Appendix A 
Proof of TheoremQ] 

We first show the existence and uniqueness of the equilibrium. Defining 3>(pi) = Fq (0i(pi))— P\ with 
Pi G [0, 1], we can see that &(pi) is a strictly decreasing function because F®(.) is an increasing function 
and 0i (pi) is a strictly decreasing function (since E [T(pi)] is strictly increasing) on their domains. By 
Definition [2j pi is an equilibrium if and only if it is a root of $(pi). Hence, it suffices to show that 
&(pi) has a unique root on its domain as follows. 
When V < c\, we clearly see that p* = is the unique root of &(pi). 
When V > ci + 9 up ~E [T(l)], we clearly see that p* = 1 is the unique root of 3>(pi). 
When ci < V < c\ + E [T(l)] 9 up , we have two following cases 
(a) There exists a p' x E (0, 1) such that 

Cl < V = ci + E [r(pi)] # up < ci + E [T(l)] up , (63) 

since E [T(pi)] is a strictly increasing function. Then we observe that 

$0l) = l-pi >0, VpiGlO,^], (64) 
$(l) = F (0i(l))-KO. (65) 



With ( |64| ), (65 1 and the fact that 3>(pi) is a continuous and strictly decreasing function, we see that 
$(pi) has a unique root p* £ (p' 1: 1) in this case, 
(b) If there does not exist any p' satisfying ( [63] ), then we have c\ < V < c\ +E [T(pj)] 9 up , Vpi € [0, 1]. 



We observe that 

$(O) = F©(0i(O))>O, (66) 
$(l)=F e (0i(l))-l<O. (67) 



With ( |66l ), (67 1 and the fact that 3>(pi) is a continuous and strictly decreasing function, we see that 
<J?(pi) has a unique root p^ G (0, 1) in this case. 
Next, we show that this unique root is a Nash equilibrium. When all SUs experience the same mean 
queueing delay E [T(p^)], pi will be a Nash equilibrium if no SU of any type 9 can increase its utility 
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by choosing an entrance probability different from p(9) in Definition [T] To see this, consider a specific 
type-0 SU 

(a) If V > 9E[T(p* 1 )\ + ci, according to Definition [T] this SU will join with probability p(9) = 1; 
hence, its expected utility is V — 9~E[T(p*)] — c\ > 0. If this SU deviates from this individual 
optimal strategy by choosing another strategy < p{9) < 1, it will receive an expected utility 
p(9)(V -6~E[T(j)* 1 )]+ci) < V -9E[T(pl)] -c 1 . Therefore, such a SU has no incentive to deviate 
from its current strategy. Conversely, if such a SU chooses another strategy p{9) < 1, it will find 
that it can increase its expected utility by switching to p{9) = 1. 

(b) If V < 0E [T(p*)] + ci, by deviating from the individual optimal strategy (i.e. p(9) = 0), the SU 
will receive a strictly smaller expected utility; hence, this SU has no incentive to deviate from its 
current strategy. 

(c) If V = #E [T(p|)] + ci, by deviating from the individual optimal strategy, the expected utility of 
this SU will still be zero; hence, this SU also has no incentive to deviate from its current strategy. 



Appendix B 
Proof of Theorem[2] 

Since q(p\) : [0, 1] h-» [0, 1] is differentiable, according to the contraction mapping (37) , the equilibrium 
can be achieved and is stable for any starting point p\ G [0, 1] if the following condition is satisfied 



aW{ Pl )\ < 1, VpiG[0,l]. 



(68) 



With q{pi) = Fq(9i(pi)), we have 

E' [T( P1 ] 



l'(Pi 



-fe(9i(pi))9 1 ( Pl 



E [T(pi 



< 



max fe(9)9 

ee[o,9 uP ] 



E ; [T( P1 ] 

max — : — ; — - 
Pl6 [0,l] E[T(p0 



E ; [T(l)] 
E[T(1)] 



(69) 



where the third equality can be determined because maxg 6 p,0 up ] f&(9) 9 = 1 and e[t(^j ^ s a positive 
and increasing function, which attains the maximum value at the upper boundary point p\ = 1. From 



(68 1 and (69 1, we complete the proof. 



Appendix C 
Proof of Theorem[3] 

If 9{p e ) < 0, from ((49]), the unique Nash equilibrium is (0, Fq (#2)) corresponding to the case of 
exclusive-use monopoly analyzed in Section |V] 
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If 9(p e ) > 9 2, since we have 9(p) > 9(p e ) > 0f, Vi according to ( |52| ), the unique Nash equilibrium 



IV 



is (Fe (#i(f>i)) , 0) corresponding to the case of shared-use monopoly analyzed in Section 

If < 9(p e ) < 6*2, we first focus on the existence and uniqueness of an equilibrium as in Definition |6j 
We define fi(pi) = F e [9 (pi)) ~Pi for Pi £ [0, 1]. We can see that Q(pi) is a strictly decreasing function 
because F®(.) and 9(p\) are increasing and strictly decreasing functions, respectively, on their domains. 
By Definition |6j p\ is an equilibrium if and only if it is a root of Sl(pi). Hence, it suffices to show that 
fJ(Pi) has a unique root on its domain. Based on the fact that 

Q(0) = F e (0(0)) > 0, (70) 

Q(l) = F e (0(1)) - K 0, (71) 

and fi(pi) is a continuous and strictly decreasing function, we see that f2(pi) has a unique root p* £ (0,1). 
Since p| = Fe (0 2 ) — Fq (0(p*)) only depends on the unique p*, it is clear that (p*,p 2 ) * s a un iq ue 
equilibrium of Definition [6] We next present that this (p*,p 2 ) * s a Nash equilibrium by showing that at 
this point, no SU of any type 9 can increase its utility by deviating from the individual optimal strategy 
in Definition [5] We note that 9(p\) < 9\ because if 9(p\) > 0|, (51 1 shows a contradiction. Therefore, 
with < 9(p*) < 9\, we consider a specific type-0 SU 

(a) If < 9 < 9{p\) < 91, we have V - 9 E [T (pi)} - a > V - 9 E [X] - c 2 . Moreover, 9(p\) < 9* 2 
leads to 9(p\) < 0i(p*), which shows that V - 0E[T(p*)] - c\ > 0. Therefore, this SU has no 
intention to deviate from the individual optimal strategy defined in Definition [5] 

(b) If < 9(p\) <9 <9* 2 , we have V - 0E [T (pf)] - Ci < F - 0E [X] - c 2 and V - 0E [X] - c 2 > 
0. Therefore, this SU has no intention to deviate from the individual optimal strategy defined in 
Definition [5] 

Appendix D 
Proof of Theorem^] 

Since qd(p) : [0, l] 2 >->■ [0, l] 2 is differentiable, according to the contraction mapping |37|, the 
equilibrium can be achieved and is stable for any initial (p?,p 2 ) if tne following condition is satisfied 

a||q' d (p)|| < 1, V P G[0,1] 2 , (72) 

where q^(p) is the Jacobian matrix of qd(p) and |.| is a matrix norm. Using the oo-norm, < 72 1 is 
equivalent to 

a\F'e(9(p x ))\ <1, Vpi€[0,l]. (73) 
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We have 

|*e(0(Pi))| 



-/e (0(pi)) 9(pi 



E' [T(pi 



E[T( Pl )]-E[X] 



< 



max / e (0)0 

ee[o,e up ] 

E' [T(l)] 



max 



E'[T(pi)] 



E[T(1)]-E[X]' 



From (73 1 and ( 74 1, we complete the proof. 



Pie[0 ,i]E[T( Pl )]-E[X] 

(74) 



Appendix E 
Proof of Theorem[5] 



The revenue of 0\ is 

Ri(c l) c 2 ) = c l p\ = c 1 F e {6{p\)) 



ci(c 2 - Cl) 



6 up (E[T(pt)]-B[X})- 



Maximizing above with respect to c\ by setting = 0, we obtain the best response of 0\ 



BRi(c 2 ) 



<"2 



Similarly, the revenue of 2 is 



R 2 (ci, c 2 ) = c 2 p* 2 = c 2 (F e (6»5) - F e (%!))) = c 2 



V — c 2 c 2 — ci 

9 up E[X]~ 9 up (B[T(pt)]-B[X}) 



Maximizing above with respect to c 2 by setting = 0, we obtain the best response of 2 

RR , , F(E [r(^)]-E[X])+ Cl E[X] 
^ 2(Cl) " 2E[T(fi)] • 

The Nash equilibrium strategy profile can be computed using the intersection of the best responses of 
both operators as follows 

V(E [T(pi)]-E[X]) 



(75) 



(76) 



(77) 



(78) 



c\ = BRx{BR 2 {c\)) 



4E[T(p\)]-E[X] ' 



(79) 
(80) 



Substituting (|79]> and (jSOJ into ( |53) ), we obtain 

® 
Pi 



V 



(? up (4E[T(pf)]-E[X]) 



(81) 



Using ([8]), the solution of the fixed-point equation ( [81] ) can be found and is equal to ( [61] ) 
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Fig. 9. Convergence behaviors of the duopoly equilibrium (pi,P2) with three different price pair settings. 
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